Accurate Methods for the Statistics of Surprise and Coincidence

نویسنده

  • Ted Dunning
چکیده

Much work has been done on the statistical analysis of text. In some cases reported in the literature, inappropriate statistical methods have been used, and statistical significance of results have not been addressed. In particular, asymptotic normality assumptions have often been used unjustifiably, leading to flawed results. This assumption of normal distribution limits the ability to analyze rare events. Unfortunately rare events do make up a large fraction of real text. However, more applicable methods based on likelihood ratio tests are available which yield good results with relatively small samples. These tests can be implemented efficiently, and have been used for the detection of composite terms, and for the determination of domain-specific terms. In some cases, these measures perform much better than the methods previously used. In cases where traditional contingency table methods work well, the likelihood ratio tests described here are nearly identical. This paper describes the basis of a measure based on likelihood ratios which can be applied to the analysis of text.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Coupled coincidence point in ordered cone metric spaces with examples in game theory

In this paper, we prove some coupled coincidence point theorems for mappings with the mixed monotone property and obtain the uniqueness of this coincidence point. Then we providing useful examples in Nash equilibrium.

متن کامل

New Method of Quality Control Test for Light and Radiation Field Coincidence in Medical Linear Accelerators

Introduction: The evaluation of X-ray and light field coincidence in linear accelerators as a quality control test is often performed subjectively, involving the manual marking of films and their visual inspection following the irradiation. Therefore, the present study aimed to develop an objective method for the performance of this test leading to the increased levels...

متن کامل

Random coincidence point results for weakly increasing functions in partially ordered metric spaces

The aim of this paper is to establish random coincidence point results for weakly increasing random operators in the setting of ordered metric spaces by using generalized altering distance functions. Our results present random versions and extensions of some well-known results in the current literature.

متن کامل

Tripled coincidence point under ϕ-contractions in ordered $G_b$-metric spaces

In this paper, tripled coincidence points of mappings satisfying  $psi$-contractive conditions in the framework of partially ordered $G_b$-metric spaces are obtained. Our results extend the results of Aydi et al. [H. Aydi, E. Karapinar and W. Shatanawi, Tripled fixed point results in generalized metric space, J. Applied Math., Volume 2012, Article ID 314279, 10 pages]. Moreover, some examples o...

متن کامل

New electrical device for field coincidence quality control test in linear accelerators

Testing of the coincidence of a linear accelerators X-ray field and the light field as a quality control test is often done in a subjective method, involving the manual marking of pieces of film and the visual inspection of the film after irradiation. The purpose of this study was to develop an objective method for performing this test, while also increasing the accuracy, precision and the spee...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Linguistics

دوره 19  شماره 

صفحات  -

تاریخ انتشار 1993